Non-Redundant Rare Itemset Generation
نویسندگان
چکیده
Rare itemsets are likely to be of great interest because they often relate to high-impact transactions which may give rise to rules of great practical significance. Research into the rare association rule mining problem has gained momentum in the recent past. In this paper, we propose a novel approach that captures such rare rules while ensuring that redundant rules are eliminated. Extensive testing on real-world datasets from the UCI repository confirm that our approach outperforms both the Apriori-Inverse(Koh et al. 2006) and Relative Support (Yun et al. 2003) algorithms.
منابع مشابه
Mining Non- Redundant Frequent Pattern in Taxonomy Datasets using Concept Lattices
In general frequent itemsets are generated from large data sets by applying various association rule mining algorithms, these produce many redundant frequent itemsets. In this paper we proposed a new framework for Non-redundant frequent itemset generation using closed frequent itemsets without lose of information on Taxonomy Datasets using concept lattices. General Terms Frequent Pattern, Assoc...
متن کاملClosed Itemset Mining and Non-redundant Association Rule Mining
DEFINITION Let I be a set of binary-valued attributes, called items. A set X ⊆ I is called an itemset. A transaction database D is a multiset of itemsets, where each itemset, called a transaction, has a unique identifier, called a tid. The support of an itemset X in a dataset D, denoted sup(X), is the fraction of transactions in D where X appears as a subset. X is said to be a frequent itemset ...
متن کاملA lattice-based approach for mining most generalization association rules
Traditional association rules consist of some redundant information. Some variants based on support and confidence measures such as non-redundant rules and minimal non-redundant rules were thus proposed to reduce the redundant information. In the past, we proposed most generalization association rules (MGARs), which were more compact than (minimal) non-redundant rules in that they considered th...
متن کاملZART: A Multifunctional Itemset Mining Algorithm
In this paper, we present platform Coron, which is a domain independent, multi-purposed data mining platform, incorporating a rich collection of data mining algorithms. One of these algorithms is a multifunctional itemset mining algorithm called Zart, which is based on the Pascal algorithm, with some additional features. In particular, Zart is able to perform the following, usually independent,...
متن کاملAn Efficient Method for Mining Frequent Weighted Closed Itemsets from Weighted Item Transaction Databases
1 Division of Data Science, Ton Duc Thang University, Ho Chi Minh, Viet Nam 4 2 Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh, Viet Nam 5 [email protected], [email protected] 6 7 Abstract: In this paper, a method for mining frequent weighed closed itemsets (FWCIs) 8 from weighted item transaction databases is proposed. The motivation for FWCIs is that 9 frequent ...
متن کامل